Waiting times for clumps of patterns and for structured motifs in random sequences

نویسندگان

  • Valery T. Stefanov
  • Stéphane Robin
  • Sophie Schbath
چکیده

This paper provides exact probability results for waiting times associated with occurrences of two types of motifs in a random sequence. First, we provide an explicit expression for the probability generating function of the interarrival time between two clumps of a pattern. It allows, in particular, to measure the quality of the Poisson approximation which is currently used for evaluation of the distribution of the number of clumps of a pattern. Second, we provide explicit expressions for the probability generating functions of both the waiting time until the first occurrence, and the interarrival time between consecutive occurrences, of a structured motif. Distributional results for structured motifs are of interest in genome analysis because such motifs are promoter candidates. As an application, we determine significant structured motifs in a data set of DNA regulatory sequences.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Norwegian Priority Setting in Practice – an Analysis of Waiting Time Patterns Across Medical Disciplines

Background Different strategies for addressing the challenge of prioritizing elective patients efficiently and fairly have been introduced in Norway. In the time period studied, there were three possible outcomes for elective patients that had been through the process of priority setting: (i) high priority with assigned individual maximum waiting time; (ii) low priority without a maximum waitin...

متن کامل

Functional motifs in Escherichia coli NC101

Escherichia coli (E. coli) bacteria can damage DNA of the gut lining cells and may encourage the development of colon cancer according to recent reports. Genetic switches are specific sequence motifs and many of them are drug targets. It is interesting to know motifs and their location in sequences. At the present study, Gibbs sampler algorithm was used in order to predict and find functional m...

متن کامل

تجلّی نقوش گرفت‌و‌گیر در فرش دوره صفوی

The hunting & animal patterns is indeed one of the most important & effective motifs which can be seen in Persian Art since ancient times. For a long period of time, these motifs (which include religious & old mythological concepts, and tell about the geographical & natural human being environment, as well as his own desires, his domineeringness, and struggle for survival), have been designed &...

متن کامل

مقایسه کارایی شاخص‌های تعیین الگوی پراکنش در درمنه زارهای استان یزد

Selection of efficient indices is very important for detecting and measuring random, uniform and clumped distribution patterns of plants in different plant communities. To compare and evaluate indices of dispersion patterns of plants, three stands were selected in Nodushan, Yazd. A (50m*100m) area was selected within each stand for sampling. Sampling was randomly systematicly conducted. Measure...

متن کامل

Approximation of word counts in Markov chains

In this talk, we give an overview about the diierent approximation results existing on the statistical distribution of word counts in a Markov chain. Results concerning the number of overlapping occurrences, the number of non-overlapping occurrences (renewals) and the declumped count will be presented. Counts of single words but also multiple words and word families are considered. We will see ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Discrete Applied Mathematics

دوره 155  شماره 

صفحات  -

تاریخ انتشار 2007